Picture for Xia Hu

Xia Hu

PrivacyPeek: Auditing What LLM-Based Agents Acquire, Not Just What They Say

Add code
May 29, 2026
Viaarxiv icon

COLLEAGUE.SKILL: Automated AI Skill Generation via Expert Knowledge Distillation

Add code
May 29, 2026
Viaarxiv icon

AgentDoG 1.5: A Lightweight and Scalable Alignment Framework for AI Agent Safety and Security

Add code
May 28, 2026
Viaarxiv icon

AgentSchool: An LLM-Powered Multi-Agent Simulation for Education

Add code
May 28, 2026
Viaarxiv icon

Understanding Generalization through Decision Pattern Shift

Add code
May 13, 2026
Viaarxiv icon

SkillSafetyBench: Evaluating Agent Safety under Skill-Facing Attack Surfaces

Add code
May 12, 2026
Viaarxiv icon

The Cartesian Shortcut: Re-evaluate Vision Reasoning in Polar Coordinate Space

Add code
May 11, 2026
Viaarxiv icon

Safactory: A Scalable Agent Factory for Trustworthy Autonomous Intelligence

Add code
May 07, 2026
Viaarxiv icon

Benchmarks for Trajectory Safety Evaluation and Diagnosis in OpenClaw and Codex: ATBench-Claw and ATBench-CodeX

Add code
Apr 16, 2026
Viaarxiv icon

ATBench: A Diverse and Realistic Agent Trajectory Benchmark for Safety Evaluation and Diagnosis

Add code
Apr 08, 2026
Viaarxiv icon